Efficient Maintenance of Materialized Top-k Views

نویسندگان

  • Ke Yi
  • Hai Yu
  • Jun Yang
  • Gangqiang Xia
  • Yuguo Chen
چکیده

We tackle the problem of maintaining materialized topk views in this paper. Top-k queries, including MIN and MAX as important special cases, occur frequently in common database workloads. A top-k view can be materialized to improve query performance, but in general it is not selfmaintainable unless it contains all tuples in the base table. Deletions and updates on the base table may cause tuples to leave the top-k view, resulting in expensive queries over the base table to “refill” the view. In this paper, we propose an algorithm that reduces the frequency of refills by maintaining a top-k′ view instead of a top-k view, where k′ changes at runtime between k and some kmax ≥ k. We show that in most practical cases, our algorithm can reduce the expected amortized cost of refill queries to O(1) while still keeping the view small. The optimal value of kmax depends on the update pattern and the costs of querying the base table and updating the view. Compared with the simple approach of maintaining either the top-k view itself or a copy of the base table, our algorithm can provide orders-ofmagnitude improvements in performance with appropriate kmax values. We show how to choose kmax dynamically to adapt to the actual system workload and performance at runtime, without requiring accurate prior knowledge.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Utilization of Materialized Views in a Data Warehouse

View Materialization is an effective method to increase query efficiency in a data warehouse. However, one encounters the problem of space insufficiency if all possible views are materialized in advance. Reducing query time by means of selecting a proper set of materialized views with a lower cost is crucial for efficient data warehousing. In addition, the costs of data warehouse creation, quer...

متن کامل

Functional Dependency Driven Auxiliary Relation Selection for Materialized Views Maintenance

In a data warehouse system, maintaining materialized views can speed up query processing. These views need to be maintained in response to updates in the base relations. This is often done for reasons of data currency, using incremental techniques rather than re-computing the view from scratch. However, when the data source changes, the views in the warehouse can become inconsistent with the ba...

متن کامل

Incremental Maintenance of Object-Oriented Views in a Warehousing Environment

Data warehousing is an approach to data integration in which integrated information is stored in a data warehouse for direct querying and analysis. To provide fast access, a data warehouse stores materialized views defined over data from its data sources. As a result, a data warehouse needs to be maintained to keep its contents consistent with the contents of its data sources. Incremental maint...

متن کامل

Reducing the Size of Auxiliary Data Needed to Support Materialized View Maintenance in a Data Warehouse Environment

A data warehouse consists of a set of materialized views that contain derived data from several data sources. Materialized views are beneficial because they allow efficient retrieval of summary data. However, materialized views need to be refreshed periodically in order to avoid staleness. During a materialized view refresh only changes to the base tables are transmitted from the data sources t...

متن کامل

Incremental Maintenance Of Materialized XQuery Views by Maged F . El - Sayed A Dissertation Submitted

Keeping views fresh by maintaining the consistency between materialized views and their base data in the presence of base updates is a critical problem for many applications, including data warehousing and data integration. While heavily studied for traditional databases, the maintenance of XML views remains largely unexplored. Maintaining XML views is complex due to the richness of the XML dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003